Bag of Senses Versus Bag of Words: Comparing Semantic and Lexical Approaches on Sentence Extraction

نویسندگان

  • Jorge García Flores
  • Laurent Gillard
  • Olivier Ferret
  • Gaël de Chalendar
چکیده

Sentence extraction is a valuable technique for automatic summarization. This paper presents LIC2M’s first participation in TAC evaluation campaign (update summarization task). We describe two main extractive approaches for summarization. The semantic strategy makes use of a bag-of-senses to calculate sense concentration on each sentence. In the lexical strategy, source text sentences are ranked according to their lexical similarity against a topic statement represented as a bag-of-words. Both approaches are compared and the evaluation results analyzed. An alternative version of the semantic strategy is proposed, where sense concentration ranking takes into account syntactic dependencies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

First Language Activation during Second Language Lexical Processing in a Sentential Context

 Lexicalization-patterns, the way words are mapped onto concepts, differ from one language      to another. This study investigated the influence of first language (L1) lexicalization patterns on the processing of second language (L2) words in sentential contexts by both less proficient and more proficient Persian learners of English. The focus was on cases where two different senses of a polys...

متن کامل

Lexical Access Preference and Constraint Strategies for Improving Multiword Expression Association within Semantic MT Evaluation

We examine lexical access preferences and constraints in computing multiword expression associations from the standpoint of a high-impact extrinsic task-based performance measure, namely semantic machine translation evaluation. In automated MT evaluation metrics, machine translations are compared against human reference translations, which are almost never worded exactly the sameway except in t...

متن کامل

Exploring the Potential of Semantic Relatedness in Information Retrieval

Employing lexical-semantic knowledge in information retrieval (IR) is recognised as a promising way to go beyond bag-of-words approaches to IR. However, it has not yet become a standard component of IR systems due to many difficulties which arise when knowledge-based methods are applied in IR. In this paper, we explore the use of semantic relatedness in IR computed on the basis of GermaNet, a G...

متن کامل

A “bag-of-arguments” mechanism for initial verb predictions

Previous studies have shown that comprehenders use rich contextual information to anticipate upcoming input on the fly, but less is known about how comprehenders integrate different sources of information to generate predictions in real time. The current study examines the time course with which the lexical meaning and structural roles of preverbal arguments impact comprehenders’ lexical semant...

متن کامل

More than Bag-of-Words: Sentence-based Document Representation for Sentiment Analysis

Most sentiment analysis approaches rely on machine-learning techniques, using a bag-of-words (BoW) document representation as their basis. In this paper, we examine whether a more fine-grained representation of documents as sequences of emotionally-annotated sentences can increase document classification accuracy. Experiments conducted on a sentence and document level annotated corpus show that...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008